A Comparison of Approaches for Learning Probability Trees

نویسندگان

  • Daan Fierens
  • Jan Ramon
  • Hendrik Blockeel
  • Maurice Bruynooghe
چکیده

Probability trees (or Probability Estimation Trees, PET’s) are decision trees with probability distributions in the leaves. Several alternative approaches for learning probability trees have been proposed but no thorough comparison of these approaches exists. In this paper we experimentally compare the main approaches using the relational decision tree learner Tilde (both on non-relational and on relational datasets). Next to the main existing approaches, we also consider a novel variant of an existing approach based on the Bayesian Information Criterion (BIC). Our main conclusion is that overall trees built using the C4.5-approach or the C4.4-approach (C4.5 without postpruning) have the best predictive performance. If the number of classes is low, however, BIC performs equally well. An additional advantage of BIC is that its trees are considerably smaller than trees for the C4.5or C4.4-approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of Approaches for Learning First-Order Logical Probability Estimation Trees

Probability Estimation Trees (PETs) [9] try to estimate the probability with which an instance belongs to a certain class, rather than just predicting its most likely class. Several approaches for learning PETs have been proposed, mainly in a propositional context. Since we are interested in applying PETs in a relational context, we make some simple modifications to the first-order tree learner...

متن کامل

How Students’ Views on Educational Factors Influence Their Achievement Motivation and Learning Approaches? Comparison of Perspectives

This comparative study was conducted to explore achievement motivation and learning approaches of agricultural students and to examine students’ views on educational factors influencing their achievement motivation and learning approaches. The statistical population of this study comprised agricultural students of Tehran University (Tehran, Iran) and Ghent University (Belgium). A sample of 89 a...

متن کامل

Application of Machine Learning Approaches in Rainfall-Runoff Modeling (Case Study: Zayandeh_Rood Basin in Iran)

Run off resulted from rainfall is the main way of receiving water in most parts of the World. Therefore, prediction of runoff volume resulted from rainfall is getting more and more important in control, harvesting and management of surface water. In this research a number of machine learning and data mining methods including support vector machines, regression trees (CART algorithm), model tree...

متن کامل

Reachability checking in complex and concurrent software systems using intelligent search methods

Software system verification is an efficient technique for ensuring the correctness of a software product, especially in safety-critical systems in which a small bug may have disastrous consequences. The goal of software verification is to ensure that the product fulfills the requirements. Studies show that the cost of finding and fixing errors in design time is less than finding and fixing the...

متن کامل

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005